Domain-Specific Ontology Mapping by Corpus-Based Semantic Similarity

نویسندگان

  • Chin Pang Cheng
  • Gloria T. Lau
  • Jiayi Pan
  • Kincho H. Law
  • Albert Jones
چکیده

Mapping heterogeneous ontologies is usually performed manually by domain experts, or accomplished by computer programs via comparing the structures of the ontologies and the linguistic semantics of their concepts. In this work, we take a different approach to compare and map the concepts of heterogeneous domain-specific ontologies by using a document corpus in a domain similar to the domain of the ontologies as a bridge. Cosine similarity and Jaccard coefficient, two vector-based similarity measures commonly used in the field of information retrieval are adopted to compare semantic similarity between ontologies. Additionally, the market basket model is modified as a relatedness analysis measure for ontology mapping. We use regulations as the bridging document corpus and the consideration of the corpus hierarchical information in concept similarity comparison. Preliminary results are obtained using ontologies from the architectural, engineering and construction (AEC) industry. The proposed market basket model appears to outperform the other two similarity measures, with its prediction error reduced using corpus structural information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web-Based Semantic Similarity: An Evaluation in the Biomedical Domain

Computation of semantic similarity between concepts is a very common problem in many language related tasks and knowledge domains. In the biomedical field, several approaches have been developed to deal with this issue by exploiting the structured knowledge available in domain ontologies (such as SNOMED-CT or MeSH) and specific, closed and reliable corpora (such as clinical data). However, in r...

متن کامل

Automated Alignment and Extraction of a Bilingual Ontology for Cross-Language Domain-Specific Applications

This paper presents a novel approach to ontology alignment and domain ontology extraction from two existing knowledge bases: WordNet and HowNet. These two knowledge bases are automatically aligned to construct a bilingual ontology based on the co-occurrence of words in a bilingual parallel corpus. The bilingual ontology achieves greater structural and semantic information coverage from these tw...

متن کامل

Estimating Semantic Distance Between Concepts for Semantic Heterogeneous Information Retrieval

This paper brings two contributions in relation with the semantic heterogeneous (documents composed of texts and images) information retrieval: (1) A new context-based semantic distance measure for textual data, and (2) an IR system providing a conceptual and an automatic indexing of documents by considering their heterogeneous content using a domain specific ontology. The proposed semantic dis...

متن کامل

Computing Knowledge-Based Semantic Similarity from the Web: An Application to the Biomedical Domain

Computation of semantic similarity between concepts is a very common problem in many language related tasks and knowledge domains. In the biomedical field, several approaches have been developed to deal with this issue by exploiting the knowledge available in domain ontologies (SNOMEDCT) and specific, closed and reliable corpuses (clinical data). However, in recent years, the enormous growth of...

متن کامل

A Relative Structure Similarity Method For Multiple Ontologies Alignment

Knowledge in domain is expressed with the help of ontology which is scattered all over its space. Using ontology gives a share in increasing precision. Different ontologies may represent the same domain, thus includes different terms that equivalently refer to the same meaning and vice versa. This results in different structures for ontologies. That's why it is necessary to relate concepts and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007